Convergence Conditions for Random Quantum Circuits 
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Efficient methods for generating pseudo-randomly distributed unitary operators are needed for the 
practical application of Haar distributed random operators in quantum communication and noise 
estimation protocols. We develop a theoretical framework for analyzing pseudo-random ensembles 
generated through a random circuit composition. We prové that the measure over random circuits 
converges exponentially (with increasing circuit length) to the uniform (Haar) measure on the uni- 
tary group, though the rate for uniform convergence must decrease exponentially with the number 
of qubits. We describe how the rate of convergence for test functions associated with speciíic ran- 
domization tasks leads to weaker convergence conditions which may allow efficient random circuit 
constructions. 
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Random unitary operators, consisting of unitary op- 
erators drawn randomly from the invariant (Haar) mea- 
sure on U{D), comprise a powerful resource for quan- 
tum computation and quantum communication. Recent 
work has shown that random unitary operators enable 
efficient protocols for characterizing noise (decoherence) 
on universal quantum processing devices 0> 0, • Noise 
estimation is an important problem for the experimen- 
tal development of coherent quantum control jj, |5j and, 
ultimatcly, fault-tolerant quantum processing via the op- 
timization of error-correction schemes ,6j. Moreover, ran- 
dom unitary operators can be applied to randomize arbi- 
trary quantum state and consequently have proven use- 
ful for a number of quantum communication protocols, 
including approximate quantum encryption [?J, remote 
state preparation || , and the superdense coding of quan- 
tum states . A practical drawback for the above appli- 
cations is that a decomposition of the Haar-distributed 
random operators in terms of one and two qubit gates 
(which is rcquircd to generate a random unitary opera- 
tor on a quantum processor) requires quantum circuits 
of exponential length 0(D 2 log(D) 3 ) |ffj|. Such circuits 
are termed inefficient because the length grows exponen- 
tially with the number of qubits (log 2 (-D)). 

In Ref. 0, famílies of random circuits, i.e., circuits 
that are composed of sequences of quantum gates drawn 
independently and randomly from some universal gate 
set, were proposed as a means of generating a set of 
pseudo-random unitary operators, i.e., a set that is for 
certain practical purposes indistinguishable from a Haar- 
distributed set of random unitary operators. The pos- 
sibility that pseudo-random sets may be generated effi- 
ciently on a quantum computer is suggested from previ- 
ous work in quantum chaos, which has shown that quan- 
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tum chaos models reproduce relevant characteristics of 
random unitary operators even though these Sys- 
tems may be simulated with efficient quantum circuits 
[l2| . Moreover, numerical studies have shown that ran- 
dom circuits of efficient length can reproduce that Haar 
distribution of subsystem purity Q,E1> an d experimental 
evidence indicates that sufficiently random operators can 
be generated with the currently available levels of exper- 
imental control with NMR quantum processors [ij, [lj] . 

In this paper we develop a set of mathematical tools 
that can be applied to analyze measures over random cir- 
cuits. We apply these tools to demonstrate that under 
increasing length of the random circuit the measure over 
random circuits converges uniformly to the Haar measure 
on the group. We show that the rate of convergence (as 
a function of increasing gate depth) is generically expo- 
nential, though the exponent must itself decrease expo- 
nentially with the number of qubits. At the end of this 
paper we describe how specific applications can define 
weaker convergence conditions, leading to the possibility 
that efficient random circuits composed from a discrete 
set of universal gates might be able to adequately mimic 
the Haar measure. 

We now prové that the measure of a product of unitary 
operators, with each factor in the product drawn inde- 
pendently from a measure enjoying support on a subset 
of U (D) that generates the full group (viz, a continuous 
universal gate set) or a dense subset of the full group 
(viz, a discrete universal gate set), converges exponen- 
tially to the constant function with respect to the Haar 
measure on the group. Let G denote the (compact Lie) 
group U(D) with elements g <E G. Let T denote the 
set of probability measures over this group. Suppose the 
elements g\ and gi are drawn at random from the mea- 
sures /i e T and / 2 G T respectively. The composed 
element g = g\ ■ gi-, where • denotes the Standard group 
multiplication, is then distributed according to 

M = (/i * h){g) = í d^Mgh-^Mh), 
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where * denotes the convolution product and dfi(g) de- 
notes the Haar measure on G. The convolution operator 
for m convolution factors is denoted f* m = f * /*( m_1 ). 
Assuming that the initial function / is a probability mea- 
sure then all its convolution powers f* m are also auto- 
matically probability measures, and hcncc have the prop- 
erties, 



V ff , r m (g) > 
d^g)r m {g) = l. 
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We are interested in the convergence properties of the 
sequence of probability distributions f* m as m — > oo. 
More precisely, we would like to prové its convergence to 
the uniform function with respect to the Haar measure 
and obtain an estimate of its rate of convergence. As 
an aside let us first describe the intuition for expecting 
convergence to the Haar measure. We note that only the 
characters of the group are stable under convolution . 
Therefore, if f* m converges at all then it must converge 
to a character of the group. Moreover it must converge 
to the Haar measure because this is the only character 
which is everywhere non-negative. The hard part is now 
to determine under what conditions the sequence f* m 
actually converges. 

An important tool in the study of convolution prod- 
ucts is the Fourier decomposition. Indeed we will make 
use of the fact that the convolution product turns into 
a simple Standard product for the Fourier components. 
In the case of compact Lie groups, a Fourier representa- 
tion of functions is provided by the Peter-Weyl theorem 
(see e.g. 0). Let G = {T s } be the set of non-equivalent 
irreducible unitary representations of G. We order the ir- 
reducible representations s by increasing dimensionality 
d s , where the trivial representation has dimensionality 
do = 1. The functions \fd~ s D s - k (taking g — ► C) for s e G 



jk 

and 1 < j, k < d s , where D^ k (g) are the matrix elements 
of T s , form a complete orthonormal set for the Hilbcrt 
space L 2 (G) of square-integrable functions on G with re- 
spect to the Haar measure. The orthogonality relations 
read: 



M9)D s tJ (9)D^ n (g) 



(2) 



where 5 SS is 1 if T s and T s are equivalent and zero other- 
wise. Then, any function / 6 L 2 (G) can be decomposed 
as, 



/(*) = £ E f%D s jk (9), 



with Fourier coefRcients 



Ijk 



d»(g) fia) D'M, 
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and in particular, 

We have the following integral identity: 

d^g)\f(g)\ 2 = E^tr((r)t/< 

seò 



< oo 



(•5) 



(6) 



Finally, the normalizcd constant function (with respect 
to the Haar measure) has Fourier coefRcients: 



A s=0 = i, 



~s>l n 



(7) 



In general the Fourier transform of the convolution 
product has a coefRcient matrix for each irreducible rep- 
resentation of the group given by the product of the co- 
efRcient matrices from the same irreducible representa- 
tion of the Fourier transforms of the original convolved 
functions. That is, given two functions the Fourier 
transform of their convolution product is given as: 



d» 
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Therefore the convoluted powers of the initial probability 
distribution reads: 




(9) 



G 



where (f* m ) s denotes the d s x d s matrix of coefRcients 
appearing in the Fourier representation of f* m and {f s ) m 
denotes the m'th power of the d s x d s matrix of coefR- 
cients appearing in the Fourier representation of the ini- 
tial distribution /. The normalization condition implies 

We assume that the initial measure / is continuous 
and enjoys support only on some (small) subset of G 
that generates the full group G. This subset might have 
very small measure compared to the full group; one ex- 
ample is the continuous gate set consisting of all single 
qubit rotations coupled with a CNOT gate between all 
qubit pairs (later we relax this assumption and consider 
convergence conditions for discrete and finite universal 
gate sets). We have the following lemma: the matrix f s 
has norm strictly less than d s for s > 0. To prové this 
we consider the usual vector norm, \x\2 = \J (x\x) , and 
observe that for all vectors x in the s representation, we 
have 0: 

\fx\2 = \d s J d^g) f{g)D (s \g)x\ 2 

G 

< d s J d l· i{g)\f{g)D (s \g)x 

G 

< d s J d^fig) \\D iS \g)\\ \x\ 2 =d s \x\ 2 

G 
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where we have introduced the norm, ||D|| = 
max^^o l-D^h/l^h, and used that ||D|| = 1 since D(g) 
is unitary. Equality holds if and only if D{g)x = £,{g)y 
is true for all g for which f(g) > 0, where £(<?) £ C. 
Since any g £ G can be generated by the support of /, 
D(g)x — £,(g)y must hold also for all g £ G. This im- 
plies we have a 1-D representation of G cmbcddcd in the 
irreducible representation s. Hence the equality is not 
saturated unless s is the trivial (identity) representation. 
It follows that, when s ^ 0, i.e. for all d s > 1, we have 
Víc, \f s x\ < d s \x\, hence 



1 " \x\ 2 



max \f s x\ 2 < d s . (10) 



Our proof is not concluded: this bound does not 
guarantee uniform convergence to the uniform measure 
(given by Eqs. Q) because the Fourier representation 
for genèric functions involves an infinite sum, and hence 
the eigenvalues of very "high frequency" Fourier modes 
can a priori come arbitrarily close to 1. Nevertheless, a 
flrst remark is that polynomial functions / (technically 
polynomials in the matrix elements of g) are described by 
a finite sum in the Peter-Weyl decomposition. For such 
a case, denoting by S the "frequency cutoff" from the 
maximal representation label for the given polynomial, 
we obtain the bound 



|(/* m ni <d a a r - 



with 




< 1, 



and, consequently, a proof of the exponential convergence 
of the convoluted powers towards the uniform measure. 

For / more generally any continuous function, one can 
repeat a similar analysis which guarantees a uniform ap- 
proximation using only a finite Fourier sum. Indeed for 
/ continuous and any e > there exists a finite N e (in- 
dependent of g) such that, for all g £ G [ïü)]. 



/(í)-EE4w 

S = l J,fc=l 



< e. 



(11) 



Let us call /at e the truncated function with the represen- 
tation cut-off N e . Then for all g £ G and m > 2, we use 
the triangle inequality to write: 

l/* m (s)-i| < \r m (g) - r N ™(g)\ + \f^{g) - 1| 

< \r m (g)-r N 7(9)\ 

Ne d s 

+ E E dl- m ((h m )jkD° k (g) 

s>l j,k=l 

It is straightforward to bound the first term using the 



fact that /jv £ * Jn, = f * Ín c ■ For all g £ G we have, 

\r m (g) - f£{g)\ = 1/ * {f (m - 1] - f N { r 1] K9)\ 

^ / d^ifigh^mr^-v - fiï™-V)(h)\ 

oo 

< e, 



< 
< 
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using the fact that / is a positive function with integral 
equal to 1. We bound the second term with the help of 
the inequality: 

tr ((h m D s ( g )) | < d s \\(h m \\ < d s \\hr, 

where we use that the vectors of the unitary matrix D s (g) 
have norm one. It directly follows that: 

\r£{g) -i|<<E d - 

S>1 

where we have defined a e = maxi <s< Ar e (||/ s ||/íi ;s ). As 
we have shown above, for all non-trivial s, ||/ s || < d s so 
that a e < 1. Therefore there exists a integer M, which 
can be chosen larger than N e , such that |/™ — l|oo < e 
for all m > M. Then it is obvious that for all m > M, 
we have bounded: 



II/* 



II 



< 2e. 



This concludes the proof that f* m converges uniformly 
to the constant probability measure on the group for 
any continuous probability measure. More generally, we 
claim that the convergence is exponential for any f £ L 2 . 
Because of Eq. I© we know that tr((f s y f s )/d s goes to 

when s goes to oo. Specifically, ||/ s || < J tr((/ s )í/ s ), 

and therefore we have H/^l/v^ ~> 0, so we don't need 
to worry about the norm of f s /d s getting close to 1 when 
s — > oo. More precisely, there exists S$ £ N such that 
\\f s \\/y/dZ < 8 < 1 for all s > Sg. Then for all s we have 
the bound 



-(™-2)|| js 



s>() 



0<s<S s s>S 5 



(m/2-2) 



where we have defined, 



as = max 

s<Ss 




< 1, 



As long as m > 6 the sum over s in the second term 
converges. Indeed, as explained in \íl\ . the irreducible 
representations of U(D) are usually labelled by a couple 
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of integers (k, l) and their dimension is given in terms of 
binomial coeficients: 



AD) 
l k,l 



k + l + D-1 
D - 1 



C. 



Cl 



k+D-2^l+D-2 



It is then straightforward to check that the dimensions 
grow sufficiently rapidly with s to make the sum over s 
converge. The exponential a m defines the convergence 
rate of the convoluted powers of / to the uniform prob- 
ability measure. 

We have shown that the measure over random circuits 
converges to the Haar measure and moreover that the 
rate of convergence (with increasing circuit length m) 
is exponential. However the exponent will generally de- 
pend on the Hilbert space dimension D. It is clear that 
for uniform convergence the exponent must decrease at 
least as rapidly as D 2 \og(D) 3 . This follows from the 
fact that 0(D 2 \og(D) 3 ) gates are required to generate 
all the elements of U(D) from a fixcd universal gate set 
0. However, the requirement of uniform convergence 
we have considered is much stronger than necessary for 
any practical application. Indeed the practical statis- 
tical distinguishability will be limited by practical con- 
straints, such as bounded computational resources or fi- 
nite sampling from the distribution. In particular we 
are concerned with convergence with respect to some op- 
erationally restricted class of test functions. Hence we 
demand only that f* m converge to the Haar measure for 
the "weak topology" , i.e. J G d^i(g) f* m (g)(f>(g) converges 
to J G dfj,(g) 4>(g) for appropriate (continuous) test func- 
tions </>. A first example of an operationally motivated 
test function is the fidelity loss under the motion reversal 
of a random unitary 0). Another example comes from 
the important case of the distinguishability of quantum 
states evolved under unitary operators drawn from dif- 
ferent distributions. Both criteria lead to test functions 
<f> which are given by polynomials in the matrix elements 
of the fundamental irreducible (unitary) representation 
D s=1 (g) of g, and the finite degree of the polynomial 
fixes a Fourier cutoff. Hence the convergence is automat- 
ically an exponential and the rate of convergence may 
lead to efficient (sealable) random circuit constructions. 
Convergence for the weak topology (i.e., for polynomial 
test functions) also allows us to extend our analysis to 
arbitrary distributions / € L , such as finite sums of 5 
functions. The same technique applies and we only need 
to require that / has support on a discrete universal gate 
set. The argument leading to Eq. still holds and ex- 
ponential convergence follows for the weak topology, as 
explained above. 

In general then the rate of convergence will depend on 
the convergence condition specified from an appropriate 
test function and on the initial probability distribution 
/. The initial distribution / can be modelled in a variety 
of ways. For example, if we start with a Gaussian packet, 
then its width will simply increase linearly with the power 



of the convolution, eventually converging to the uniform 
measure. The exact convergence rate is then determined 
directly from the maximum matrix norm of the Fourier 
components of the initial distribution, where the maxi- 
mum is taken only over those Fourier components below 
the cutoff determined from the test function. The appro- 
priate test function (and Fourier cutoff) will generally 
depend on the specific application. Indeed the numerical 
analysis of Refs. 0, El indicates that for a test function 
given by the subsystem purity the rate of the exponen- 
tial convergence is asymptotically independent of the di- 
mension D, suggesting that random circuits can generate 
Haar-distributed subsystem purity efficiently. 
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